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The occurrence of digits 1 through 9 as the leftmost nonzero digit of numbers from real-world sources 
is distributed unevenly according to an empirical law, known as Benford’s law or the first digit law. It 
remains obscure why a variety of data sets generated from quite different dynamics obey this particular 
law. We perform a study of Benford’s law from the application of the Laplace transform, and find that the 
logarithmic Laplace spectrum of the digital indicator function can be approximately taken as a constant. 
This particular constant, being exactly the Benford term, explains the prevalence of Benford’s law. The 
slight variation from the Benford term leads to deviations from Benford’s law for distributions which 
oscillate violently in the inverse Laplace space. We prove that the whole family of completely monotonic 
distributions can satisfy Benford’s law within a small bound. Our study suggests that the origin of 
Benford’s law is from the way that we write numbers, thus should be taken as a basic mathematical 
knowledge. 

© 2019 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license 

(http://creativecommons.Org/licenses/by/4.0/). 


1. Introduction 

There is an empirical law concerning the occurrence of the first 
digits in real-world data, stating that the first digits of natural 
numbers prefer small ones rather than a uniform distribution as 
might be expected. More accurately, the probability that a num¬ 
ber begins with digit d, where d — 1 , 2 ,..., 9 respectively, can be 
expressed as 

Pd = logioO + J), d = 1,2, ..., 9 , (1) 

as shown in Fig. 1. This is known as Benford’s law, which is also 
called the first digit law or the significant digit law, first noticed 
by Newcomb [1] in 1881, and then re-discovered independently by 
Benford [2] in 1938. 

Empirically, the areas of lakes, the lengths of rivers, the Ara¬ 
bic numbers on the front page of a newspaper [2], physical con¬ 
stants [ 3 ], the stock market indices [ 4 ], file sizes in a personal com¬ 
puter [ 5 ], survival distributions [ 6 ], etc., all conform to this peculiar 
law well. Due to the powerful data analyzing tools provided by 
computer science, Benford’s law has been verified for a vast num- 
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Fig. 1 . Benford’s law of the first digit distribution, from which we see that the prob¬ 
ability of finding numbers with leading digit 1 is more than 6 times larger than that 
with 9. 

her of examples in various domains, such as economics [7,8], social 
science [6], environmental science [9], biology [10], geology [11], 
astronomy [12], statistical physics [13,14], nuclear physics [15-17], 
particle physics [18], and some dynamical systems [19,20], There 
have been also many explorations on the applications of the law 
in various fields, e.g., in upgrading the description in precipita¬ 
tion regime shift [21], Some applications focus on detecting data 
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and judging their reasonableness, such as distinguishing and as¬ 
certaining fraud in taxing and accounting [22-25], fabrication in 
clinical trials [26], the authenticity of the pollutant concentrations 
in ambient air [9], electoral cheats or voting anomalies [5,27], and 
falsified data in scientific experiments [28], Moreover, the first 
digit law is applied in computer science for speeding up calcu¬ 
lation [29], minimizing expected storage space [30,31], analyzing 
the behavior of floating-point arithmetic algorithms [31], and also 
for various studies in the image domain [32,33], 

Theoretically, several elegant properties of Benford’s law have 
been revealed. In mathematics, Benford’s law is the only digit law 
that is scale-invariant [34,35], which means that the law does 
not depend on any particular choice of units. This law is also 
base-invariant [36-38], which means that it is independent of 
the base b. In the octal system (b = 8), the hexadecimal system 
(b = 16), or other base systems, the data, if fit the law in the deci¬ 
mal system (b = 10), all fit the general Benford’s law 

P d = logfcd + J), d = l,2. b- 1. (2) 

a 

The law is also found to be power-invariant [18], i.e., any power 
(^ 0) on numbers in the data set does not change the first digit 
distribution. 

There have been many studies on Benford’s law with numerous 
breakthroughs. For example, Hill provided a measure-theoretical 
proof that Benford’s law is equivalent to the scale-invariant prop¬ 
erty and that random samples taken from randomly-selected dis¬ 
tributions converge to Benford’s law [37,38]. Pietronero et al. ex¬ 
plained why some data sets naturally show scale-invariant proper¬ 
ties from a dynamics governed by multiplicative fluctuations thus 
conform to Benford’s law [39], Gottwald and Nicol figured out that 
deterministic quasiperiodic or periodic forced multiplicative pro¬ 
cess and even affine processes also tend to Benford’s law [40], 
Engel and Leuenberger focused on exponential distributions and 
illustrated that they approximately obey Benford’s law within a 
bound of 0.03 [41]. Smith applied digital signal processing and 
studied the distributions on the logarithmic scale and their fre¬ 
quency domain, revealing that the first digit law holds for distri¬ 
butions with no components of nonzero integer frequencies [42], 
Fewster asserted that any distribution might tend to Benford’s 
law if it can span several orders of magnitude and be reasonably 
smooth [43]. 

However, there are still various data sets that violate Benford’s 
law, e.g., the telephone numbers, birthday data, and accounts with 
a fixed minimum or maximum. Benford’s law still remains obscure 
whether this law is merely a result of our way of writing numbers. 
If the answer is yes, why not all number sets obey this law; if the 
answer is no, why is this law so common that it can be a good ap¬ 
proximation for most data sets. The situation can also be reflected 
by some puzzles about Benford’s law in the literature, e.g., it is 
stated by Tao that no one can really prove or derive this law be¬ 
cause Benford’s law, being an empirically observed phenomenon 
rather than an abstract mathematical fact, cannot be “proved” the 
same way a mathematical theorem can be proved [44], Aldous 
and Phan also suggested that without checking the assumptions 
of Benford’s law for the data sets we studied, this logically correct 
mathematical theorem is not relevant to the real world [45], 

Therefore, most studies on Benford’s law are case studies in 
literature, restricted to a specific probability density distribution 
or a group of them. In this work, we provide a general deriva¬ 
tion of Benford’s law with the application of the Laplace trans¬ 
form, which is an important tool of mathematical methods in 
physics [46], From our derivation, we can safely assert that the 
deviation from Benford's law is always less than a small propor¬ 
tion of the L^norm of the logarithmic inverse Laplace transform 


of the probability density function. This bound is universal. Since 
the L'-norm of the logarithmic inverse Laplace transform is usually 
small but not zero, Benford’s law is commonly well obeyed but not 
strictly obeyed. We introduce a guideline to judge how well a spe¬ 
cific distribution obeys Benford’s law. In this method, the degree of 
deviation from the law is associated with the oscillatory behavior 
of the probability density function in the inverse Laplace space. We 
find that the whole family of completely monotonic distributions 
can all fulfill Benford’s law within a small bound. We also carry out 
some numerical estimations of the error term, and present several 
examples which verify our method. We agree with Goudsmit and 
Furry [47] and reveal from our own method that the appearance 
of the first digit law is a logical consequence of the digital system, 
but not due to some unknown mechanics of the nature. 

Our work is organized as follows. In Sec. 2 we introduce the 
digital indicator functions for the given digital system and put for¬ 
ward an intuitive explanation of Benford’s law by revealing the 
heterogeneity of such functions among different first digits. In 
Sec. 3 we apply the Laplace transform to the digital indicator func¬ 
tions to reveal their elegant properties. From these, in Sec. 4 we 
provide a general derivation of a strict version of Benford’s law 
and prove that the strict Benford’s law is composed of a Benford 
term and an error term. In Sec. 5 we study the error term by ap¬ 
plying our general result to four categories of number sets, which 
obey Benford's law to varying degrees. Especially, we prove that 
completely monotonic distributions can satisfy Benford’s law well. 
Numerical studies are also provided to verify our method. Sec. 6 is 
reserved for conclusions. 

2. The intuition 

Let F(x ) be an arbitrary normalized probability density function 
(PDF) defined on the positive real number set M + (here we use the 
capital letter F instead of the lowercase one, due to conventions 
for the Laplace transform introduced in Sec. 4). It does not matter 
if negative data are allowed, for we can instead use the PDFs of 
their absolute values. 

In the decimal system, the probability P d of finding a number 
with first digit d is the sum of the probability that it is within the 
interval [d-10 n , (d + l)-10 n ) for an integer n, therefore P d can be 
expressed as 


(d+i)-io n 


P d = 


^ n 

E / 

1 -- " 


F (x )dx, 


d-10" 

which can also be rewritten as 


(3) 


OO 

Pd = J F(x)g d (x) dx, (4) 

o 

where g d (x) is the digital indicator function (DIF), indicating num¬ 
bers with first digit d in the decimal system (here the lowercase 
letter is used, also due to conventions of the Laplace transform). 
Using the notation of the Heaviside step function, 


?l(x) 


1, ifx> 0, 
0, ifx<0, 


we can write g d (x) as 


(5) 


(X) 

gd(x)= [^7( x — d-10 n ) — ?7(x — (d + l)-10 n )]. (6) 

n=—oo 

Different first digits define different g d (x) functions, thus be¬ 
have differently in the digital system. For a better illustration, we 
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Fig. 2. Images of digital indicator functions gi (x) and g 2 (x). Neither of them can be 
a translation or an expansion of the other. 


draw the images of gi(x) and g 2 (x) in the interval [1,30), as 
shown in Fig. 2. We notice that g 2 (x) can be neither a translation 
nor an expansion of gi (x), and that the gap between the shaded 
areas in g 2 (x) is wider than that in gi(x). This fact intuitively ex¬ 
plains the inequality among the 9 digits, where smaller leading 
digits are more likely to appear. 

Furthermore, if drawn on the logarithmic scale, gd(x) becomes 
a periodic function with a mean value of log 10 (l + J). This gives 
us the intuition why gd(x) has a strong connection with Benford’s 
law. In the following sections, through strict mathematical deriva¬ 
tions, we verify our intuition and show that the Benford term 
comes exactly from gd(x). 

3. The Laplace transform of the digital indicator function 

In this section, we study the Laplace transform of the digi¬ 
tal indicator function (DIF) and show that the transformed DIF is 
also a log-periodic function which frequently appears in various 
systems [48], and exhibits some elegant properties that indicate 
Benford’s law. For general cases, we can define the DIF under 
base-b as g b jj(x), whose value is 1 for numbers within the in¬ 
terval [d-b n , (d + l)-b n ) for some integer n and 0 otherwise, i.e„ 

OO 

gb,d, i(x)= [i)(x-d-b n )-ri(x-(d + l)-b n )]. (7) 

n =—oo 

The Laplace transform of this general DIF is defined as 

oo 

Cb,d,i(t) = J gb,d,i(x)e~ tx dx. (8) 

0 

We turn to the logarithmic scale again and further define 

Hb,d,l(t) = tCb,d,l(t), 

~ 9 

H b .d,i( s ) = H b , d j(e s ) ■ 

The properties of Hb,d,i(s) are given as follows: 


H b ,d.l( s ) = 

OO / 

Y\ ( exp[—d-b < -" + fi'6- ) ] 


exp [—(d + l)-b^ n+ 



( 10 ) 


For the second property, we have the mean value of Fl b d,i(s) 
within [0, lnb) as 


lnf> 


( Ht> 'd j(S) ) = h \b J ” bM(S) 


ds 


= -^ J ^exp[—d-e s ] — exp[-(d + l)-e s ] ) ds 


i r i 

lnb 


/tC 


- i e dt - e~ (d+l)t ) 


dt 


o 


= log b (l +-). 


( 11 ) 


With these two properties, it is straightforward to rewrite 
H bMJ (s) as 


HM./(s) = lo&(l + i) + A MiI (s), (12) 

d 

where A b ,d,i( s ) represents the periodic fluctuation of Fl b d,i(s) 
around its mean value. It is noted here that Fl bt d,i(s) is the log¬ 
arithmic Laplace spectrum of the DIF. Therefore, it is independent 
of any particular distributions of number sets. 

The first term \og b (\ + j) is here called the Benford term and 
we will show in Sec. 4 that it is the origin of the classical Benford’s 
law, while A b b j(s) is responsible for the possible deviation from 
the law. We will show in Sec. 5 that this deviation is small for a 
big family of distributions. 

It is worth noting that the Benford term is derived merely from 
the DIF of a certain digital system without assuming the exact 
form of the PDF. Therefore, we assert that the origin of Benford’s 
law comes from the way that the digital system is constructed, in¬ 
stead of the way that some specific number set is formed. 


4. The derivation of the general digit law 


We see that the logarithmic Laplace spectrum of the digital 
indicator function fluctuates around the Benford term. Another 
reason why we choose the Laplace transform is that the inverse 
Laplace transform can be served as a method to judge how well a 
specific PDF obeys the law, as well as to derive the general digit 
law. For an arbitrary PDF F(x), we can assume that it has an in¬ 
verse Laplace transform f (t) which belongs to L 1 ®" 1 "), satisfying 

OO 

F(x) = J f(t)e~ tx dt. (13) 

o 

The probability that a number drawn from a data set with a 
PDF F(x) is within the set Un^-oo^- d + l) xb n can be expressed 
as 


1. Fl b d,i(s) is periodic with period lnb; 

2. the mean value of FI b jj(s) within any single period is 
l°gb(l + j)- 

The first property is obvious by expanding FI b j,i(s) from 
Eqs. (8) and (9), i.e., 


Pb.d.l 


OO 

J F(x)gb,d,i(x) dx. 
0 


We turn to the logarithmic scale again and define 


7 (s) = /(e s ), 


(14) 


(15) 
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then f(s) also satisfies the normalization condition, i.e., 


oo oo oc 

J /(s)ds = J dt — J F(x) dx=l. 


(16) 


According to the property of the Laplace transform, Eq. (14) can be 
rewritten in the inverse Laplace space of the PDF as 

OO OO 

J F(*)gM,/(*) dx= J /(t)Gb,d,/(t) dt 


0 


oo 

= J /(s)H M ,,(s)ds. 

—OO 


(17) 



s 


Fig. 3. The image of the lnlO-periodic function Aio,i,i(s) for the first digit 1 in the 
decimal system. 

5. The error term 


Combining the expression of H bd d (s) in Eq. (12) and the nor¬ 
malization condition of /(s) in Eq. (16), we derive the strict form 
of Benford’s law, which is composed of a Benford term and an er¬ 
ror term, as follows, 

OO 

Pb,d,i = lo SbC l + ^)+ J 7(s)A M ,/(s)ds. (18) 

—OO 

Since A bdd (s) is slightly fluctuating, the error term is small for 
most circumstances, as we intend to illustrate further in Sec. 5. if 
we ignore the error term in Eq. (18), the strict Benford’s law turns 
into the general digit law, i.e., 

+ ( 19 ) 

Lots of variations of the classical Benford’s law can be seen as 
corollaries of the general digit law. For example, the base fa can be 
set to 100 to derive the second significant digit law given by New¬ 
comb [1], A number (x)io in the decimal system can be equally 
treated as a number (x)ioo in the base-100 system, so that the 
second digit of (x)io being d is equivalent to that either the first 
“digit” of (x )too belongs to the set = {10+d, 20+d, • • ■ , 90+d}, 
or that the first “digit" of (lOxhoo belongs to the same set S d . 
Therefore, we have 


P(2nd digit of (x)io = d) 

= P(lst “digit” of (x)ioo G Sd) + 

P(lst “digit” of (10x)ioo g S d ) 

9 9 

* E lo Si°o ^ + tofc + d) + E logl0 ° ( 1 + lOkTd) 

/<=1 x 7 k = 1 x 7 

= £log 10 (l + T ^L-). (20) 

/<=1 v 7 

Similar reasoning can also be applied to the lth-significant 
digit law of Hill [38]: letting D; (Di,D 2 ,...) denotes the ith- 
significant digit (with base 10) of a number (e.g., Dt (0.0314) = 3, 
D2(0.0314) = 1, 03(0.0314) = 4), then for all positive integers k 
and all dj e 0,1, ■ ■ ■ ,9, j = 1 ,2, ■ ■ ■ ,k, one has 


P(D t =dj. 


, D k = d k ) « 


logio 




( 21 ) 


In this section, we introduce a method to judge how well a 
certain PDF obeys the classical Benford’s law by analyzing the total 
error term in Eq. (18), which is the interrelation of A b , dd (s) and 
7(s), i.e., 

OO 

Atotal ,b.d,l= J 7(s)A b , d ,/(s)ds. (22) 

—OO 

We know in Sec. 4 that A bdd (s) is a In fa-periodic function with a 
mean value of 0. For instance, a graph of Aio,i,i(s) is shown in 
Fig. 3. The amplitude of this periodic function is small compared 
with the Benford term, e.g., the amplitude of Aio,i,i(s) is less than 
0.03 while the Benford term is 0.30. Therefore, intuitively speaking, 
if / (s) is smooth enough and changes slowly, its interrelation with 
A b , dd (s) tends to be averaged out; thus the total error tends to be 
small. On the other hand, if /(s) oscillates violently, the interre¬ 
lation is highly sensitive to the exact form of /(s), and the total 
error is likely to be large. 

Rigorously, we can classify real-world number sets into the fol¬ 
lowing four categories, with different degrees of deviation from the 
classical Benford’s law. 

1. The error term equals to the constant 0 for scale-invariant dis¬ 
tributions. 

2. Since the error term is bounded by a proportion of the 
i'-norm of f(s) (noted by ||/||i), i.e., 

Atotai ,b,dJ £ [\\7h miniA b , d ,/}, ll/lli max{A b . d ,,j], (23) 

when f (s) oscillates mildly between positive and negative val¬ 
ues, ||/||i is close to 1, thus the error term is small. 

3. Specifically, if f(s)>0 holds for Vs el, ||/||i reaches its min¬ 
imum value 1, so the bound is the tightest, i.e., 

Atotal.M,/ e [min{ A b d /}, max{A Md }]. (24) 

Such distributions are called completely monotonic distribu¬ 
tions. ~ 

4. When f(s) oscillates dramatically, its L’-norm becomes large, 
so the error term becomes uncertain and the classical Ben- 
ford's law is generally violated. 

We prove and explain the above assertions one by one in the fol¬ 
lowing sections. Some numerical examples are provided for better 
illustration. 

5 . 1 . Scale-invariant distribution 

We first turn to scale-invariant distributions which are initially 
discussed by Hill on the prospect of the probability measure the- 
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ory [37]. Pietronero et al. [39] has shown that scale invariant dis¬ 
tributions arise naturally from any multiplicative stochastic process 
such as the dynamics of stock prices. With Laplace transform, we 
can also show that scale invariance leads to Benford’s law. 

Following the definition of Hill [37], a scale-invariant probabil¬ 
ity measure P is a measure defined on the following a -algebra 


*= U 


B x b n 


for some Borel B c [1, b), (25) 


satisfying P(S) = P(\S) for all A. > 0 and S e ,M. Hill proved that 
such a scale-invariant measure strictly satisfies Benford’s law. We 
can easily prove this result again with the language of PDF, if 
we set S to be UJji-ooW’ d + 1) x b n and notice that the scale- 
invariance property implies the following statement, i.e„ for all 
tel, 


oo oo 

J 7(s)A M ./(s)ds= J 7(s)A M ,/(s + e)ds = C (26) 

—OO —OO 

holds, where C is independent of e. According to the periodic and 
zero-mean properties of Ab,d./( s ), we have 


lnb co 


J ( J 7(s)A MJ (s + e)ds^ de 


0 — oo 


oo /In b \ 

J J(s) I j A M ,,(s + e)de J ds 


-oo \ 0 


J 7(s)0 


• ds 


= 0 . 


(27) 


Thus, we get /g nb C ds = 0, i.e., C = 0. We notice that C is also the 
error term in Eq. (22). Therefore, such scale-invariant distributions 
conform strictly to the classical Benford’s law. 


5.2. Small- 1| /||i distribution 


Table 1 

Numerical results of the Benford term P® 0 d 1 and the maximum of the absolute 
error term max{|Af,,d,/|} in the decimal system, together with the relative errors 


d 

1 

2 

3 

4 

5 

6 

7 

8 

9 

A 1(W,1 / % 
omax lo/ 
°10,d,l /* 

30.10 

2.97 

9.9 

17.61 

I. 94 

II. 0 

12.49 

I. 41 

II. 3 

9.69 

1.11 

11.4 

7.92 

0.91 

11.5 

6.69 

0.76 

11.5 

5.80 

0.77 

11.6 

5.12 

0.59 

11.6 

4.58 

0.53 

11.7 


Benford’s law holds well, with a maximum relative error of 12 * 
||/||i% for all digits. 

The exponential distribution is a good example of small-1|/II i 
distributions. 

F(x) = Xe~ lx (X> 0). (29) 

Engel and Leuenberger showed that the exponential distribution 
obeys Benford’s law approximately within bounds of 0.03 (for b = 
10 and d — l— 1) [41], This fact can be explained by Eq. (24) if 
we notice that the logarithmic inverse Laplace transform of the 
exponential distribution is f(s) = S(s — In A.). Therefore, ||/||i = 1, 
so 

A total, to, i,i 6 [min{ Am.i.i}, max{A 10 ,i,ij] 

C (-0.03, 0.03). 

The log-normal distribution with a big variance is another ex¬ 
ample. The PDF is 

1 (lnx-/r) 2 

F(x) = -— e 2 a 2 . (31) 

xffV2ff 

As long as a is not too small relative to the base b, f ( s ) oscil¬ 
lates mildly between positive and negative values, so ||/||i is also 
considerably small. For example, when /x = 5 and a — 1, 

OO 

ll/lli = f |7(s)|ds = 1.610. (32) 

—OO 

Then, from Eq. (23) we have 


Although scale invariance is common in nature, not all natural 
data sets are scale invariant. Even for those data sets which are not 
scale invariant, Benford’s law can still be a good approximation for 
most cases. This is because the error term in Eq. (22) can be well 
bounded by the I 1 -norm of f, as is shown in Eq. (23). 

The proof is from the fact that ||/||i is an upper bound of the 
integral of a function. According to Eq. (22), we have 


A total, to, 1,1 G [ll/lli min{Aio,i,i), ||7lli maxjAio.i.i}] 

C (-0.048. 0.047), 

which is also acceptable compared to the Benford term 0.301. 
5.3. Completely monotonic distribution 


ds 


| 1 min{A b , d ,/} = min{A b , d ,/} / |7(s)| 

—oo 

— Atotal,b,d,/ 

oo 

< max{ A b d /} J |7(s)|ds 

—OO 

= ll7llimax{A b , dJ }, 


(28) 


where ||/||i = |/(s)|ds is the L 1 -norm of f(s). 

In the decimal system, we numerically calculate the Benford 
terms (noted by P® od ^) and the maximum value of |A b d /| (noted 
by ,) in Table 1. The relative errors <5™ , = A™* .,/P^^ 
are also listed. From Table 1, we notice that when ||/||i is small, 


The family of completely monotonic (c.m.) distributions is a 
special case of small-||/||i distributions. Completely monotonic 
distributions are probability distributions with c.m. PDFs. This is 
equivalent to say that f(s) is non-negative for all set. In this 
case, || / ||i reaches its minimum value 1 due to the normalization 
condition in Eq. (16), thus we get the tightest bound in Eq. (24). In 
fact, the exponential distributions and the scale invariant distribu¬ 
tions that we have discussed above are both c.m., but the family of 
c.m. functions are much more prosperous. Miller and Samko [49] 
surveyed a series of good properties of completely monotonic func¬ 
tions. Herein we summarize these properties again for the conve¬ 
nience of the readers. 

1. A function F with domain (0, oo) is said to be c.m. if it 
possesses derivatives F (n> (x) for all n = 0,1,2,3,... and if 
(—l) n F (n) (x) > 0 for all x > 0. 
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2. (Bernstein-Widder theorem) F(x) is c.m. if and only if F(x) is 
the Laplace transform of a non-negative measurable function 
/(t): t —> [0, +oo) and F(x) < +00 for 0 < x < + 00 . 

3. The following elementary functions are all c.m. functions: 


e 


—ax 


a > 0, 


1 

(a + cx) a ’ 

In (a + -), 
x 


a, c, a > 0, 
a > 1, c > 0. 


(34) 


4. If F(x) is c.m., then e fw is also c.m. 

5. If Fi(x) and F 2 (x) are c.m., then aFi(x) + cF 2 (x), a > 0, c > 0 
is also c.m. 

6. If Fi(x) and F 2 (x) are c.m., then Fi(x)F 2 (x) is also c.m. 

7. Let F(x) be c.m. and let t(x) be nonnegative with a c.m. 
derivative, then F(r(x)) is also c.m. 


From Properties 4, 5, 6, 7, we can generate a large family of 
c.m. functions from elementary c.m. functions in Property 3. For 
such a c.m. function F(x) to be a valid PDF, it should also satisfy 
the normalization condition in Eq. (16). When 


OO 

J F(x) dx < 00 , (35) 

0 


the normalization condition can be guaranteed by introducing a 
normalization factor. 

Several examples of c.m. PDFs are listed below. The parame¬ 
ters are thus chosen so that the integral in Eq. (35) converges. The 
normalization factors are omitted in Eq. (36). Distributions gener¬ 
ated from these PDFs all satisfy Benford’s law within a very small 
bound: 


e -a(x+c)“ 
e a(x+c)“ _ y , 


1 

(X + C)“ ’ 

— e-™*, 
x v 

g—a(lnx+c)“ 


a, c > 0, 0 < a < 1, 
a, c > 0, a < 0, 

c > 0, a > 1, 

a > 0, 0 < a < 1, 0 < v < 1, 
a, c > 0, 0 < a < 1. 


(36) 


Some non-c.m. distributions can be converted to c.m. distribu¬ 
tions through a non-linear transformation of the data. Literature 
has shown that non-linear transformations on some data sets yield 
more robust results when Benford’s law is used to detect fraud 
[50]. To explain this, suppose x is a random variable with PDF F(x), 
if we transform x into y = r(x), then the PDF of y becomes 


fl F(r k ~'(y» 
hi \^r k -\y))\’ 


(37) 


where n(y) is the number of solutions in x for the equation r(x) = 
y and (y) is the fcth solution. 

One example of this case is the normal distribution with the 
PDF 


1 (*-/ 0 2 

F(x)= e 2,2 . (38) 

V2na 2 

Through the transformation y = (x — /z) 2 , the PDF becomes 
F(y)= 1 e~5fr, y > 0. (39) 

yJ2jio 2 y 



Fig. 4. The images of log-normal PDF F(x) with a = 1.0, 0.5 and 0.3, with // = In 5 
fixed. 
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Fig. 5. The images of Aio.i.ifs) together with /(s) for the log-normal distribution 
with a = 1.0, 0.5 and 0.3. Noted that /(s) displays stronger oscillatory behavior 
when a decreases. 


Eq. (39) is c.m. Therefore, the transformed data set of y fulfills 
Benford’s law with an error bound of 0.03, same to that of expo¬ 
nential distributions. 

5.4. Violently-oscillating-f ( s) distribution 

The only case in which Benford’s law loses its power is when 
f (s) oscillates violently between positive and negative values. The 
fast oscillation of /(s) makes the small term of Ab.d.i be counted 
and accumulated again and again. Hence, ||/||i becomes large, and 
f(s) is highly sensitive to some tiny perturbation on F(x), reflect¬ 
ing the instability of the inverse Laplace transform [51]. Therefore, 
the total error is also highly sensitive to the exact form of F(x) and 
Benford’s law is generally violated in this case. 

Examples of such violently-oscillating-/(s) distributions are 
log-normal distributions with small a and uniform distributions. 
We have shown that the log-normal distribution with parameters 
/z = In 5 and a — 1.0 approximately conforms to the classical Ben¬ 
ford’s law. However, when a becomes smaller, the distribution is 
concentrated on some specific first digits, as shown in Fig. 4 for 
a = 1.0, 0.5, 0.3. In the logarithmic inverse Laplace space, we nu- 
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Fig. 6. Pio.i.i of the log-normal distribution compared with the Benford term 
P®o ] , under 0 < cr < 1.5. 


Fig. 8. Pio.i.i of the uniform distribution compared with the Benford term Pf 0 , 1 
under 0 < a < 50. 
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Fig. 7. The images of Aio,i,i(s) together with /(s) for the uniform distribution with 
a = 10, 20 and 30. Be noted that f(s ) functions oscillate much more fiercely than 
those in Fig. 5. 


the endpoints of the PDFs, occasionally coincides with the Benford 
term but generally violates the classical Benford’s law. This is again 
expected. 

At last, we want to rectify two typical misunderstandings about 
Benford’s law, i.e., 

1. random data sets without human manipulation are supposed 
to fulfill Benford's law; 

2. smooth distributions that span many orders of magnitude 
should satisfy Benford’s law. 

Unfortunately, however, neither of these two assertions are correct. 

As we have shown, the first statement is only approximately 
true for random variables generated from PDFs with small ||/||i. 
Even natural random data sets could violate Benford’s law if their 
PDFs oscillate violently in the inverse Laplace space. For example, 
lots of natural number sets are distributed normally or lognormally 
near their mean values with very small variances, such as heights 
of all trees on the earth. Such data sets, although natural, do not 
obey Benford’s law. 

As for the second statement, whether or not a distribution sat¬ 
isfies Benford’s law is determined by the shape, instead of the 
scale, of the PDF. Therefore, even an extremely flat PDF which 
spans many orders of magnitude may still violate Benford’s law. 
To understand this, we note that one can change the scale of any 
PDF Fj (x) by multiplying the original data with an arbitrary num¬ 
ber a. The PDF of the new data set is 


merically calculate f (s) by the Stehfest method [52-54] and plot 
them together with Aio.i.i(s) in Fig. 5. We notice that /(s) dis¬ 
plays stronger oscillatory behavior as a decreases. Thus the in¬ 
terrelation between Aio,i,i(s) and /(s) is highly sensitive to the 
exact form of /(s), or some tiny perturbation on F(x). 

In fact, we can numerically calculate Pio.i.i directly from 
Eq. (4) for 0 < a < 1.5 and the results are shown in Fig. 6. This 
verifies our prediction. When a becomes smaller, f(s) oscillates 
stronger and P 10 , 1,1 deviates further from the Benford term. 

For the uniform distribution on the interval [1, a] (a > 1), f(s) 
oscillates even stronger. The PDF is given by 

Fix) = . (40) 

a — 1 

We use an analytic function to approach F(x), and calculate the 
numerical values of the inverse Laplace transform for a = 10, 20 
and 30, as shown in Fig. 7. Since /(s) is extremely unstable for 
such distributions, we can expect the total error is unstable as well, 
generally large. 

Also for verification, we draw the numerical values of Pio.i.i 
under 1 < a < 50 in Fig. 8. Pio.i.i in this case depends greatly on 


F 2 W = -Fi(-). (41) 

a a 

When a turns bigger, F 2 (x) is flattened out, and it can span as 
many orders of magnitude as we desire. However, the logarithmic 
inverse Laplace transform of Fjfx) and F 2 (x) differ only by a hori¬ 
zontal shift, i.e., 

? 2 (s) = /i(s + ln(a)). (42) 

Such a horizontal shift does not change the unstable nature of the 
error term in Eq. (22). 

If we desire to reduce the error term, we need to flatten out 
/i(s) directly, e.g., into / 2 (s) = When a becomes bigger, 

the total error of Benford’s law in Eq. (22), as the interrelation be¬ 
tween a periodic function and an extremely flat / 2 (s), tends to 
vanish. In this case, the shape of the PDF Fi(x) has been changed. 
In fact, when a is big enough and /r(l) i=- 0 (this can be guaran¬ 
teed up to a scaling factor in Eq. (41)), F 2 (x) approaches to the 
scale invariant distribution, i.e., 










































M. Cong et al. / Physics Letters A 383 (2019) 1836-1844 


1843 


F 2 (x) _ C^hahe-^dt 

F 2 ( cx) / 0 °° X / 1 (t' a )e- ctx dt 

= /o°° h(the~ tx dt (43) 

- }fo°°M(c^) e - tx dt 
Q '^ +0 °> c, forVc>0. 

pointwise 

Such an operation brings Benford’s law back to power again be¬ 
cause the shape of the original PDF, not only the scale, has been 
changed. 

6. Summary 

The first digit law has revealed an astonishing regularity of nat¬ 
ural number sets. We introduce a method of the Laplace transform 
to study the law in depth. Our method can explain the long¬ 
standing puzzle about Benford's law, i.e., whether or not Benford’s 
law is merely a result of the way of writing numbers. Our answer 
is yes in the sense that the Benford term can be derived inde¬ 
pendently of any specific probability distributions. This does not 
conflict with the fact that when the L’-norm of the logarithmic 
inverse Laplace transform of the PDF is large, Benford’s law is al¬ 
ways violated. 

Besides, the method sets a bound on the error term, allowing us 
to predict the validity of Benford’s law by the logarithmic inverse 
Laplace transform of an arbitrary PDF. Real-world distributions can 
be categorized into four types, corresponding to their oscillatory 
behavior in the inverse Laplace space. A milder oscillation guaran¬ 
tees higher conformity to the law, and vice versa. Especially, the 
whole family of completely monotonic distributions all obey Ben¬ 
ford’s law within a small bound. Numerical examples are shown to 
verify our method. It is not strange anymore why Benford’s law is 
so successful in various domains of human knowledge. Such a law 
should receive attention as a basic mathematical knowledge, with 
great potential for vast application. 

Acknowledgements 

This work is supported by National Natural Science Foundation 
of China (Grant No. 11475006). It is also supported by National 
Innovation Training Program for Undergraduates. 

References 

[1] S. Newcomb, Note on the frequency of use of the different digits in natural 
numbers, Am. J. Math. 4 (1) (1881) 39-40. 

[2] F. Benford, The law of anomalous numbers, Proc. Am. Philos. Soc. (1938) 
551-572. 

[3] J. Burke, E. Kincanon, Benford’s law and physical constants: the distribution of 
initial digits, Am. J. Phys. 59 (10) (1991) 952. 

[4] E. Ley, On the peculiar distribution of the U.S. stock indexes’ digits, Am. Stat. 
50 (4) (1996) 311-313. 

[5] J. Torres, S. Fernandez, A. Gamero, A. Sola, How do numbers begin? (The first 
digit law), Eur. J. Phys. 28 (3) (2007) LI7. 

[6] L.M. Leemis, B.W. Schmeiser, D.L. Evans, Survival distributions satisfying Ben¬ 
ford’s law, Am. Stat. 54 (4) (2000) 236-241. 

[7] D.E. Giles, Benford’s law and naturally occurring prices in certain ebaY auc¬ 
tions, Appl. Econ. Lett. 14 (3) (2007) 157-161. 

[8] M.J. Nigrini, Persistent patterns in stock returns, stock volumes, and account¬ 
ing data in the U.S. capital markets, J. Account. Audit. Financ. 30 (4) (2015) 
541-557. 

[9] R.J. Brown, Benford’s Law and the screening of analytical data: the case of pol¬ 
lutant concentrations in ambient air, Analyst 130 (9) (2005) 1280-1285. 

[10] E. Costas, V. Lopez-Rodas, F.J. Toro, A. Flores-Moya, The number of cells in 
colonies of the cyanobacterium Microcystis aeruginosa satisfies Benford’s law, 
Aquat. Bot. 89 (3) (2008) 341-343. 

[11] M.J. Nigrini, S.J. Miller, Benford’s law applied to hydrology data—results and 
relevance to other geophysical data, Math. Geol. 39 (5) (2007) 469-490. 


[12] L. Shao, B.-Q. Ma, Empirical mantissa distributions of pulsars, Astropart. Phys. 
33 (4) (2010) 255-262. 

[13] L. Shao, B.-Q. Ma, The significant digit law in statistical physics, Phys. A, Stat. 
Mech. Appl. 389 (16) (2010) 3109-3116. 

[14] L. Shao, B.-Q. Ma, First-digit law in nonextensive statistics, Phys. Rev. E 82 (4) 
(2010) 041110. 

[15] D. Ni, Z. Ren, Benford’s law and half-lives of unstable nuclei, Eur. Phys. J. A 
Hadrons Nuclei 38 (3) (2008) 251-255. 

[16] X.J. Liu, X.P. Zhang, D.D. Ni, Z.Z. Ren, Benford’s law and cross-sections of 
A(n,G?)B reactions, Eur. Phys. J. A Hadrons Nuclei 47 (6) (2011) 78, https:// 
doi.org/10.1140/epja/i2011-11078-3. 

[17] H. Jiang, J.-J. Shen, Y.-M. Zhao, Benford’s law in nuclear structure physics, Chin. 
Phys. Lett. 28 (3) (2011) 032101. 

[18] L. Shao, B.-Q. Ma, First digit distribution of hadron full width, Mod. Phys. Lett. 
A 24 (40) (2009) 3275-3282. 

[19] A. Berger, L. Bunimovich, T. Hill, One-dimensional dynamical systems and Ben¬ 
ford’s law, Trans. Am. Math. Soc. 357 (1) (2005) 197-219. 

[20] A. Berger, Multi-dimensional dynamical systems and Benford’s law, Discrete 
Contin. Dyn. Syst. 13 (1) (2005) 219-237. 

[21] L. Yang, Z. Fu, Out-phased decadal precipitation regime shift in China and the 
United States, Theor. Appl. Climatol. 130 (1-2) (2017) 535-544. 

[22] M.J. Nigrini, A taxpayer compliance application of Benford’s law, J. Am. Tax. 
Assoc. 18 (1) (1996) 72-91. 

[23] C. Durtschi, W. Hillison, C. Pacini, The effective use of Benford’s law to assist in 
detecting fraud in accounting data, J. Forensic Account. 5 (1) (2004) 17-34. 

[24] W.K. Tam Cho, B.J. Gaines, Breaking the (Benford) law: statistical fraud detec¬ 
tion in campaign finance, Am. Stat. 61 (3) (2007) 218-223. 

[25] M.J. Nigrini, The implications of the similarity between fraud numbers and the 
numbers in financial accounting textbooks and test banks, J. Forensic Account. 
Res. 1 (1) (2016) A1-A26. 

[26] S. Al-Marzouki, S. Evans, T. Marshall, I. Roberts, Are these data real? Statistical 
methods for the detection of data fabrication in clinical trials, BMJ 331 (7511) 
(2005) 267-270. 

[27] L. Leemann, D. Bochsler, A systematic approach to study electoral fraud, Elect. 
Stud. 35 (2014) 33-47. 

[28] A. Diekmann, Not the first digit! Using Benford’s law to detect fraudulent sci¬ 
entific data, J. Appl. Stat. 34 (3) (2007) 321-329. 

[29] J.L. Barlow, E.H. Bareiss, On roundoff error distributions in floating point and 
logarithmic arithmetic, Computing 34 (4) (1985) 325-347. 

[30] P. Schatte, On mantissa distributions in computing and Benford’s law, J. Inf. 
Process. Cybern. 24 (9) (1988) 443-455. 

[31] A. Berger, T.P. Hill, Newton’s method obeys Benford’s law, Am. Math. Mon. 
114 (7) (2007) 588-601. 

[32] J.-M. Jolion, Images and Benford’s law, J. Math. Imaging Vis. 14 (1) (2001) 
73-81. 

[33] D. Fu, Y.Q. Shi, W. Su, A generalized Benford’s law for JPEG coefficients and its 
applications in image forensics, in: Security, Steganography, and Watermarking 
of Multimedia Contents, 2007, p. 65051L. 

[34] R.S. Pinkham, On the distribution of first significant digits, Ann. Math. Stat. 
32 (4) (1961) 1223-1230. 

[35] A. Berger, T.P. Hill, ICE. Morrison, Scale-distortion inequalities for mantissas of 
finite data sets, J. Theor. Probab. 21 (1) (2008) 97-117. 

[36] T.P. Hill, The significant-digit phenomenon, Am. Math. Mon. 102 (4) (1995) 
322-327. 

[37] T.P. Hill, Base-invariance implies Benford’s law, Proc. Am. Math. Soc. 123 (3) 
(1995) 887-895. 

[38] T.P. Hill, A statistical derivation of the significant-digit law, Stat. Sci. 10 (4) 
(1995) 354-363. 

[39] L. Pietronero, E. Tosatti, V. Tosatti, A. Vespignani, Explaining the uneven distri¬ 
bution of numbers in nature: the laws of Benford and Zipf, Phys. A, Stat. Mech. 
Appl. 293 (1-2) (2001) 297-304. 

[40] G.A. Gottwald, M. Nicol, On the nature of Benford’s law, Phys. A, Stat. Mech. 
Appl. 303 (3-4) (2002) 387-396. 

[41] H.-A. Engel, C. Leuenberger, Benford’s law for exponential random variables, 
Stat. Probab. Lett. 63 (4) (2003) 361-365. 

[42] S.W. Smith, The Scientist and Engineer’s Guide to Digital Signal Processing, 2nd 
edition, California Technical Pub., 1999. 

[43] R.M. Fewster, A simple explanation of Benford’s law, Am. Stat. 63 (1) (2009) 
26-32. 

[44] T. Tao, An Epsilon of Room, II: Pages from Year Three of a Mathematical Blog, 
American Mathematical Society, 2010. 

[45] D. Aldous, T. Phan, When can one test an explanation? Compare and contrast 
Benford’s law and the fuzzy CLT, Am. Stat. 64 (3) (2010) 221-227. 

[46] G.B. Arfken, H.J. Weber, F.E. Harris, Mathematical Methods for Physicists: 
A Comprehensive Guide, 7th edition, Academic Press, 2012. 

[47] S.A. Goudsmit, W.H. Furry, Significant figures of numbers in statistical tables, 
Nature 154 (3921) (1944) 800-801. 

[48] D. Sornette, Discrete-scale invariance and complex dimensions, Phys. Rep. 
297 (5) (1998) 239-270. 

[49] K.S. Miller, S.G. Samko, Completely monotonic functions, Integral Transforms 
Spec. Funct. 12 (4) (2001) 389-402. 







1844 


M. Cong etal./ Physics Letters A 383 (2019) J 836-1844 


[50] P. Clippe, M. Ausloos, Benford’s law and Theil transform of financial data, Phys. 
A, Stat. Mech. Appl. 391 (24) (2012) 6556-6567. 

[51] R. Bellman, R.E. Kalaba, J.A. Lockett, Numerical Inversion of the Laplace Trans¬ 
form, American Elsevier Publishing Co., 1966. 

[52] H. Stehfest, Algorithm 368: numerical inversion of Laplace transforms [D5], 
Commun. ACM 13 (1) (1970) 47-49. 


[53] A.H. Cheng, P. Sidauruk, Y. Abousleiman, Approximate inversion of the Laplace 
transform, Math. J. 4 (2) (1994) 76-82. 

[54] H. Hassanzadeh, M. Pooladi-Darvish, Comparison of different numerical Laplace 
inversion methods for engineering applications, Appl. Math. Comput. 189 (2) 
(2007) 1966-1981. 


